SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Celeste, Anthony J. 

Wozney, John 
Rosen, Vicki A. 
Wolf man, Neil. 
Thomsen, Gerald H. 
Melton, Douglas A. 

(ii) TITLE OF INVENTION: TENDON- INDUCING COMPOSITIONS 

(iii) NUMBER OF SEQUENCES: 35 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: GENETICS INSTITUTE, INC. 

(B) STREET: 87 "Cambridge Park Drive 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP : 02140 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

f 5 * (D) SOFTWARE : Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 
^ (B) FILING DATE: Herewith 

S_ (C) CLASSIFICATION: 

m (viii) ATTORNEY/AGENT INFORMATION: 
K (A) NAME: Lazar, Steven R. 

Ul (B) REGISTRATION NUMBER: 32,618 

' (C) REFERENCE /DOCKET NUMBER: 5202 -D 

fi (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617 498-8260 

(B) TELEFAX: 617 876-5851 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 
.(B) CLONE: vl-1 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 571.. 882 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
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# 



(B) LOCATION: 1. . 882 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GCG CGT AAT ACG ACT CAC TAT AGG GCG AAT TGG GTA CGG GGC CCA GGC 4 8 

Ala Arg Asn Thr Thr His Tyr Arg Ala Asn Trp Val Arg Gly Pro Gly 
-190 -185 -180 -175 

AGC TGG ACT TCT CCG CCG TTG CTG CTG CTG TCC ACG TGC CCG GGC GCC 96 
Ser Trp Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys Pro Gly Ala 
-170 -165 -160 

GCC CGA GCG CCA CGC CTG CTG TAC TCG CGG GCA GCT GAG CCC CTA GTC 14 4 

Ala Arg Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu Pro Leu Val 
-155 -150 -145 

GGT CAG CGC TGG GAG GCG TTC GAC GTG GCG GAC GCC ATG AGG CGC CAC 192 
Gly Gin Arg Trp Glu Ala ?he Asp Val Ala Asp Ala Met Arg Arg His 
-140 -135 -130 

CGT CGT GAA CCG CGC CCC CCC CGC GCG TTC TGC CTC TTG CTG CGC GCA 240 
Arg Arg Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu Leu Arg Ala 
^ -125 -120 -115 

GTG GCA GGC CCG GTG CCG AGC CCG TTG GCA CTG CGG CGA CTG GGC TTC 288 
£ Val Ala Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg Leu Gly Phe 
|r -110 -105 -100 -95 

P 1 GGC TGG CCG GGC GGfi^GGG GGC TCT GCG GCA GAG GAG CGC GCG GTG CTA 336 
S3 Gly Trp Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg Ala Val Leu 
jy -90 -85 -80 

* GTC GTC TCC TCC CGC ACG CAG AGG AAA GAG AGC TTA TTC CGG GAG ATC 384 
p Val Val Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu He 
-75 -70 -65 

W CGC GCC CAG GCC CGC GCG CTC GGG GCC GCT CTG GCC TCA GAG CCG CTG 4 32 

If* Arg Ala Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser Glu Pro Leu 

jf** -60 -55 -50 

M 1 CCC GAC CCA GGA ACC GGC ACC GCG TCG CCA AGG GCA GTC ATT GGC GGC 480 
Pro Asp Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val He Gly Gly 
-45 -40 -35 

* 

CGC AGA CGG AGG AGG ACG GCG TTG GCC GGG ACG CGG ACA GCG CAG GGC 52 8 

Arg Arg Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr Ala Gin Gly 
-30 -25 -20 -15 

AGC GGC GGG GGC GCG GGC CGG GGC CAC GGG CGC AGG GGC CGG AGC CGC 5 76 

Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg 
-10 -5 1 

TGC AGC CGC AAG CCG TTG CAC GTG GAC TTC AAG GAG CTC GGC TGG GAC 6 24 

Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp 
5 10 15 

GAC TGG ATC ATC GCG CCG CTG GAC TAC GAG GCG TAC CAC TGC GAG GGC 6 72 

Asp Trp He He Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly 
20 25 30 

CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG CCC ACC AAC CAT GCC 720 
Leu Cvs Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala 
35 40 45 •- 50 
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ATC ATT CAG ACG CTG CTC AAC TCC ATG GCA CCA GAC GCG GCG CCG GCC 768 
He He Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala 
55 60 65 

TCC TGC TGT GTG CCA GCG CGC CTC AGC CCC ATC AGC ATC CTC TAC ATC 816 
Ser Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser He Leu Tyr He 
70 75 80 

GAC GCC GCC AAC AAC GTT GTC TAC. AAQ CAA : TAC . : GAG. .GAC ATG GTG GTG 864 
Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val 
85 90 95 

GAG GCC TGC GGC TGC AGG . TAGCGCGCGG GCCGGGGAGG GGGCAGCCAC 912 
Glu Ala Cys Gly Cys Arg 
100 

GCGGCCGAGG ATCC ' 926 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
if** (A) LENGTH: 294 amino acids 

^ (B) TYPE: amino acid 

*B (d) TOPOLOGY: linear 

Jz (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



m Ala Arg Asn Thr Thr His Tyr Arg Ala Asn Trp Val Arg Gly Pro Gly 

-190 -185 -180 -175 

a Ser Trp Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys Pro Gly Ala 
m -170 - -165 -160 

W Ala Arg Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu Pro Leu Val 
iyj -155 -150 -145 

:U Gly Gin Arg Trp Glu Ala Phe Asp Val Ala Asp Ala Met Arg Arg His 
Jsp? -140 -135 -130 

Arg Arg Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu Leu Arg Ala 
-125 -120 -115 

Val Ala Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg Leu Gly Phe 
-110 -105 -100 -95 

Gly Trp Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg Ala Val Leu 
-90 -85 -80 

Val Val Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu He 
-75 -70 -65 

Arg Ala Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser Glu Pro Leu 
-60 -55 -50 

Pro Asp Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val lie Gly Gly 
-45 -40 -35 

Arg Ara Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr Ala Gin Gly 
-30 " -25 -20 -15 

Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg 
-10 -5 1 
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Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp 
5 10 15 

Asp Trp He He Ala Pro Leu Asp Tyr Glu Ala Tyr- His Cys Glu Gly 
20 25 30 

Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala 
35 40 45 50 

He He Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala 
55 60 65 

Ser Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser He Leu Tyr He 
70 75 80 

Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val 
85 90 95 

Glu Ala Cys Gly Cys Arg 
100 

(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 
i} (A) LENGTH: 1207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

jfU (vi) ORIGINAL SOURCE: 

g " (A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 
S3 (B) CLONE: MP52 

w 

ruX (ix) FEATURE: 

r" (A) NAME/KEY: CDS 

O (B) LOCATION: 845.. 1204 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ACCGGGCGGC CCTGAACCCA AGCCAGGACA CCCTCCCCAA ACAAGGCAGG CTACAGCCCG cD 
GACTGTGACC CCAAAAGGAC AGCTTCCCGG AGGCAAGGCA CCCCCAAAAG CAGGATCTGT 12 0 

CCCCAGCTCC TTCCTGCTGA AGAAGGCCAG GGAGCCCGGG CCCCCACGAG AGCCCAAGGA 130 
GCCGTTTCGC CCACCCCCCA TCACACCCCA CGAGTACATG CTCTCGCTGT ACAGGACGCT 24 0 

GTCCGATGCT GACAGAAAGG GAGGCAACAG CAGCGTGAAG TTGGAGGCTG GCCTGGCCAA 3 00 

CACCATCACC AGCTTTATTG ACAAAGGGCA AGATGACCGA GGTCCCGTGG TCAGGAAGCA 36 0 

GAGGTACGTG TTTGACATTA GTGCCCTGGA GAAGGATGGG CTGCTGGGGG CCGAGCTCCG -.2 0 

GATCTTGCGG AAGAAGCCCT CGGACACGGC CAAGCCAGCG GCCCCCGGAG GCGGGCGGGC 
TGCCCAGCTG AAGCTGTCCA GCTGCCCCAG CGGCCGGCAG CCGGCCTCCT TGCTGGATGT 
GCGCTCCGTG CCAGGCCTGG ACGGATCTGG CTGGGAGGTG TTCGACATCT GGAAGCTCTT 
CCGAAACTTT AAGAACTCGG CCCAGCTGTG CCTGGAGCTG GAGGCCTGGG AACGGGGCAG 
GGCCGTGGAC CTCCGTGGCC TGGGCTTCGA CCGCGCCGCC CGGCAGGTCC ACGAGAAGGC 
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CCTGTTCCTG GTGTTTGGCC GCACCAAGAA ACGGGACCTG TTCTTTAATG AGATTAAGGC 780 

CCGCTCTGGC CAGGACGATA AGACCGTGTA TGAGTACCTG TTCAGCCAGC GGCGAAAACG 840 

GCGG GCC CCA CTG GCC ACT CGC CAG GGC AAG CGA CCC AGC AAG AAC CTT 889 
Ala Pro Leu Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys Asn Leu 
1 5 10 15 



AAG GCT CGC TGC AGT CGG AAG GCA CTG " CAT * GTC AAC TTC AAG GAC ATG 937 
Lys Ala Arg Cys Ser Arg Lys Ala Leu His Val Asn Phe Lys Asp Met 
20 25 30 

GGC TGG GAC GAC TGG ATC 'ATC GCA CCC CTT GAG TAC GAG GCT TTC CAC 985 
Gly Trp Asp Asp Trp lie lie Ala Pro Leu Glu Tyr Glu Ala Phe His 
35 40 45 

TGC GAG GGG CTG TGC GAG TTC CCA TTG CGC TCC CAC CTG GAG CCC ACG 1033 
Cys Glu Gly Leu Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr 
50 55 60 

AAT CAT GCA GTC ATC CAG ACC CTG ATG AAC TCC ATG GAC CCC GAG TCC 1081 
Asn His Ala Val lie Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser 
65 70 75 

ACA CCA CCC ACC TGC TGT GTG CCC ACG CGG CTG AGT CCC ATC AGC ATC 112 9 

Thr Pro Pro Thr Cys Cys Val Pro Thr Arg Leu Ser Pro lie Ser lie 

80 85 90 95 

ill 

|M> CTC TTC ATT GAC TCT^jSCC AAC AAC GTG GTG TAT AAG CAG TAT GAG GAC 117 7 

m Leu Phe lie Asp Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp 

St 100 105 110 



3 ATG GTC GTG GAG TCG TGT GGC TGC AGG TAG 1207 
ir* Met Val Val Glu Ser Cys Gly Cys Arg 
2£ 115 120 

yj 

y 

y, (2) INFORMATION FOR SEQ ID NO:4: 

;M ! (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Ala Pro Leu Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys Asn Leu Lys 
15 10 15 

Ala Arg Cys Ser Arg Lys Ala Leu His Val Asn Phe Lys Asp Met Gly 
20 25 30 

Trp Asp Asp Trp lie lie Ala Pro Leu Glu Tyr Glu Ala Phe His Cys 
35 40 45 

Glu Gly Leu Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn 
50 55 60 

His Ala Val lie Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser Thr 
65 70 75 80 

Pro Pro Thr Cys Cys Val Pro Thr Arg Leu Ser Pro lie Ser lie Leu 
85 90 95 
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Phe lie Asp Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met 
100 105 110 

Val Val Glu Ser Cys Gly Cys Arg 
115 120 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 'linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: £omo Sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: VI -1 fragment 

p (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 28 . . 102 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

jjj GGATCCTGGA AGGATTGGAT CATTGCG CCG CTG GAC TAC GAG GCG TAC CAC 51 

69 Pro Leu Asp Tyr Glu Ala Tyr His 

fy is- 

JiL TGC GAG GGC CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG CCC ACC 99 

y Cys Glu Gly Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr 

m 10 15 20 

^ AAC CACGCTATAG TCCAAACCTT TCTAGA 12 8 

iM Asn 
fl 25 

3,-5.. 

(2) INFORMATION FOR SEQ ID NO : 6 : 

( i ) SEQUENCE CHARACTERISTICS : ' 

(A) LENGTH: 2 5 amino acids * 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Leu Cys Asp Phe Pro 
15 10 15 

Leu Arg Ser His Leu Glu Pro Thr Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE : DNA ( genomi c ) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: VL-1 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 28 . . 102 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GGATCCTGGG ATGACTGGAT TATGGCG CCG CTG GAC TAC GAG GCG TAC CAC 51 

Pro Leu Asp Tyr Glu Ala Tyr His 
1 5 

TGC GAG GGT GTA TGC GAC TTC CCG CTG CGC TCG CAC CTG GAG CCC ACC . 99 
Cys Glu Gly Val Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr 
10 15 20 



. & AAC CACGCCATGC TACAAACGCT TCTAGA 128 

*U _ 

1=. Asn 
25 



j** (2) INFORMATION FOR J^EQ ID NO : 8 : 
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(xi) SEQUENCE DESCRIPTION. : SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
3 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

f% Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro 
:pf 1 5 10 15 

Leu Arg Ser His Leu Glu Pro Thr Asn 

20 25 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE .SOURCE: 

(B) CLONE: pALVl-781 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CTAACTACCC AACTCAAAAA AAAAAAAAAA AAAAACCCCC TCTAACCCCC ATTGACGAAA 6 0 

GGGCCTCGTG ATACGCCTAT TTTTATAGGT TAATGTCATG ATAATAATGG TTTCTTAGAC 12 0 

GTC.kGGTGGC ACTTTTCGGG GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT 180 
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• 



ACATTCAAAT 


ATGTATCCGC 


TCATGAGACA 


ATAACCCTGA 


TAAATGCTTC 


AATAATATTG 


240 


AAAAAGGAAG 


AGTATGAGTA 


TTCAACATTT 


CCGTGTCGCC 


CTTATTCCCT 


TTTTTGCGGC 


300 


ATTTTGCCTT 


CCTGTTTTTG 


CTCACCCAGA 


AACGCTGGTG 


AAAGTAAAAG 


ATGCTGAAGA 


360 


TCAGTTGGGT 


GCACGAGTGG 


GTTACATCGA 


ACTGGATCTC 


AACAGCGGTA 


AGATCCTTGA 


420 


GAGTTTTCGC 


CCCGAAGAAC 


GTTTTCCAAT 


GATGAGCACT 1 


TTTAAAGTTC 


TGCTATGTGG 


480 


CGCGGTATTA 


TCCCGTATTG 


ACGCCGGGCA 


AGAGCAACTC 


GGTCGCCGCA 


TACACTATTC 


540 


TCAGAATGAC 


TTGGTTGAGT 


ACTCACCAGT 


CACAGAAAAG 


CATCTTACGG 


ATGGCATGAC 


600 


AGTAAGAGAA 


TTATGCAGTG 


CTGCCATAAC 


CATGAGTGAT 


AACACTGCGG 


CCAACTTACT 


660 


TCTGACAACG 


ATCGGAGGAC 


CGAAGGAGCT 


AACCGCTTTT 


TTGCACAACA 


TGGGGGATCA 


720 


TGTAACTCGC 


CTTGATCGTT 


GGGAACCGGA 


GCTGAATGAA 


GCCATACCAA 


ACGACGAGCG 


780 


TGACACCACG 


ATGCCTGTAG 


CAATGGCAAC 


AACGTTGCGC 


AAACTATTAA 


CTGGCGAACT 


840 


Q ACTTACTCTA 


GCTTCCCGGC 


AACAATTAAT 


AGACTGGATG 


GAGGCGGATA 


AAGTTGCAGG 


900 


^ ACCACTTCTG 


CGCTCGGCCC 


TTCCGGCTGG 


CTGGTTTATT 


GCTGATAAAT 


CTGGAGCCGG 


960 


^ TG AG CGTGGG 


TCTCGCGGTA 


TCATTGCAGC 


ACTGGGGCCA 


GATGGTAAGC 


CCTCCCGTAT 


1020 


III 

" = CGTAGTTATC 


TACACGACCJG- 


GGAGTCAGGC 


AACTATGGAT 


GAACGAAATA 


GACAGATCGC 


1080 


TGAGATAGGT 


GCCTCACTGA 


TTAAGCATTG 


GTAACTGTCA 


GACCAAGTTT 


ACTCATATAT 


1140 


i » ACTTXAG ATT 
p TGATAATCTC 


GATTTAAAAC 


TTCATTTTTA 


ATTTAAAAGG 


ATCTAGGTGA 


AGATCCTTTT 


1200 


ATGACCAAAA 


TCCCTTAACG 


TGAGTTTTCG 


TTCCACTGAG 


CGTCAGACCC 


1260 


P? CGTAGAAAAG 

Ui- 


ATCAAAGGAT 


CTTCTTGAGA 


TCCTTTTTTT 


CTGCGCGTAA 


TCTGCTGCTT 


1320 


M, GCAAACAAAA 


AAACCACCGC 


TACCAGCGGT 


GGTTTGTTTG 


CCGGATCAAG 


AGCTACCAAC 


1380 


Hi 

™" TCTTTTTCCG 
GTAGCCGTAG 


AAGGTAACTG 


GCTTCAGCAG 


AGCGCAGATA 


CCAAATACTG 


TCCTTCTAGT 


1440 


TTAGGCCACC 


ACTTCAAGAA 


CTCTGTAGCA 


CCGCCTACAT 


ACCTCGCTCT 


1500 


GCTAATCCTG 


TTACCAGTGG 


CTGCTGCCAG 


TGGCGATAAG 


TCGTGTCTTA 


CCGGGTTGGA 


156-0 


CTCAAGACGA 


TAGTTACCGG 


ATAAGGCGCA 


GCGGTCGGGC 


TGAACGGGGG 


GTTCGTGCAC 


1620 


ACAGCCCAGC 


TTGGAGCGAA 


CGACCTACAC 


CGAACTGAGA 


TACCTACAGC 


GTGAGCATTG 


1680 


AGAAAGCGCC 


ACGCTTCCCG 


AAGGGAGAAA 


GGCGGACAGG 


TATCCGGTAA 


GCGGCAGGGT 


1740 


CGGAACAGGA 


GAGCGCACGA 


GGGAGCTTCC 


AGGGGGAAAC 


GCCTGGTATC 


TTTATAGTCC 


1800 


TGTCGGGTTT 


CGCCACCTCT 


GACTTGAGCG 


TCGATTTTTG 


TGATGCTCGT 


CAGGGGGGCG 


1860 


GAGCCTATGG 


AAAAACGCCA 


GCAACGCGGC 


CTTTTTACGG 


TTCCTGGCCT 


TTTGCTGGCC 


1920 


TTTTGCTCAC 


ATGTTCTTTC 


CTGCGTTATC 


CCCTGATTCT 


GTGGATAACC 


GTATTACCGC 


1980 


CTTTGAGTGA 


GCTGATACCG 


CTCGCCGCAG 


CCGAACGACC 


GAGCGCAGCG 


AGTCAGTGAG 


2 04 0 


CGAGGAAGCG 


GAAGAGCGCC 


CAATACGCAA 


ACCGCCTCTC 


CCCGCGCGTT 


GGCCGATTCA 


2100 


TTAATGCAGA 


ATTGATCTCT 


CACCTACCAA 


ACAATGCCCC 


CCTGCAAAAA 


ATAAATTCAT 


2160 


ATAAAAAACA 


TACAGATAAC 


CATCTGCGGT 


GATAAATTAT 


CTCTGGCGGT 


GTTGACATAA 


2220 
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ATACCACTGG 


CGGTGATACT 


GAGCACATCA 


GCAGGACGCA 


CTGACCACCA 


TGAAGGTGAC 


2280 




GCTCTTAAAA 


ATTAAGCCCT 


GAAGAAGGGC 


AG CATTCAAA 


GCAGAAGGCT 


TTGGGGTGTG 


2340 




TGATACGAAA 


CGAAGCATTG 


GCCGTAAGTG 


CGATTCCGGA 


TTAGCTGCCA 


ATGTGCCAAT 


2400 




CGCGGGGGGT 


TTTCGTTCAG 


GACTACAACT 


GCCACACACC 


ACCAAAGCTA 


ACTGACAGGA 


2460 




GAATCCAGAT 


GGATGCACAA ACACGCCGCC GCGAACGTCG CGCAGAGAAA CAGGCTCAAT 


2520 




GGAAAGCAGC 


AAATPPPPTG 


TTGGTTGGGG 

X X W\J X X- X-* 


TAAGCGCAAA 


ACCAGTTCCG 


AAAGATTTTT 


2580 




TTAACTATAA 


a nn era a Tacz 


AAGPGTTTAT 


GCGGAAGAGG 


TAAAGCCCTT 


CCCGAGTAAC 


2640 




AAAAAAACAA 


pa^ipata aat 


AAPPPPGCTC 


TTACACATTC 


CAGCCCTGAA 


AAAGGGCATC 


2700 




AAATTAAACC 


a p a ppt a *rnn 


TfiT ATf? P ATT 


TATTTGCATA 


CATTCAATCA 


ATTGTTATCT 


2760 




AAGGAAATAC 


X X InlVJl 


P T PG TTGTT C 


TCGTAAACCA 


CTGCATGTAG 


ATTTTAAAGA 


2820 




GCTCGGCTGG 




TPATPGPGPC 


GCTGGACTAC 


GAGGCGTACC 


ACTGCGAGGG 


2880 


%4 


CCTTTGCGAC 


X lv-LL 1 X XOv> 


GTTPGPAPPT 


CGAGCCCACC 


AACCATGCCA 


TCATTCAGAC 


2940 




GCTGCTCAAC 


TCCATGGCAC 


CAGACGCGGC 


GCCGGCCTCC 


TGCTGTGTGC 


CAGCGCGCCT 


3000 




CAGCCCCATC 


AGCATCCTCT 


ACATCGACGC 


CGCCAACAAC 


GTTGTCTACA 


AGCAATACGA 


3060 




GGACATGGTG 


GTGGAGGCPT 


GCGGCTGCAG 


GTAGTCTAGA 


GTCGACCTGC 


AGTAATCGTA 


3120 


a : s 


CAGGGTAGTA 


CAAATAAAAA 


AGGCACGTCA 


GATGACGTGC 


CTTTTTTCTT 


GTGAGCAGTA 


3180 




AGCTTGGCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


CGTTACCCAA 


3240 




CTTAATCGCC 


TTGCAGCACA 


TCCCCCTTTC 


GCCAGCTGGC 


GTAATAGCGA 


AGAGGCCCGC 


3 3.00 




ACCGATCGCC 


CTTCCCAACA 


GTTGCGCAGC 


CTGAATGGCG 


AATGGCGCCT 


GATGCGGTAT 


3360 




TTTCTCCTTA 


CGCATCTGTG 


CGGTATTTCA 


CACCGCATAT 


ATGGTGCACT 


CTCAGTACAA 


3420 


O 


TCTGCTCTGA 


TGCCGCATAG 


TTAAGCCAGC 


CCCGACACCC 


GCCAACACCC 


GCTGACGCGC 


3480 




CCTGACGGGC 


TTGTCTGCTC 


CCGGCATCCG 


CTTACAGACA 


AGCTGTGACC 


GTCTCCGGGA 


3540 




GCTGCATGTG 


TCAG&GGTTT 


TCACCGTCAT 


CACCGAAACG 


CGCGA 
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(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: mVl 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 28 . .243 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGATCCAAGG AGCTCGGCTG GGACGAC TGG ATC ATC GCG CCA TTA GAC TAC 51 

Trp lie He Ala Pro Leu Asp Tyr 
1 5 

GAG GCA TAC CAC TGC GAG GGC GTT TGC GAC TTT CCT CTG CGC TCG CAC 99 
Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro Leu Arg Ser His 
10 15 1 20 " : 

CTG GAG CCT ACC AAC CAC GCC ATC ATT CAG ACG CTG CTC AAC TCC ATG 14 7 

Leu Glu Pro Thr Asn His Ala He He Gin Thr Leu Leu Asn Ser Met 
25 30- 35 40 

GCG CCC GAC GCT GCG CCA GCC TCC TGC TGC GTG CCC GCA AGG CTC AGT 195 
Ala Pro Asp Ala Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser 
45 50 55 

CCC ATC AGC ATT CTC TAC ATC GAT GCC GCC AAC AAC GTG GTC TAC AAG 24 3 

Pro He Ser He Leu Tyr He Asp Ala Ala Asn Asn Val Val Tyr Lys 
60 65 70 

O CAATACGAGG ACATGGTGGT GGGGAATTC 2 72 



^ (2) INFORMATION FOR SEQ ID NO: 11: 



fJl (i) SEQUENCE CHARACTERISTICS: 
y^" (A) LENGTH: 72 amino acids 

^ (B) TY^E: amino acid 

Ui (D) TOPOLOGY: linear 

m 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CB 

Trp He lie Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Val 
1 5 10 15 

SjSKSi 

jQ Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He 
s_i 20 25 30 

He Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala Ser 
35 40 45 

Cys Cys Val Pro Ala Arg Leu Ser Pro He Ser lie Leu Tyr lie Asp 
50 55 60 

Ala Ala Asn Asn Val Val Tyr Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: mV2 
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# 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 28.. 243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGATCCAAGG AGCTCGGCTG GGACGAC TGG ATT ATC GCG CCC CTA GAG TAC .51 

Trp'Ile lie Ala Pro lieu Glu Tyr 
1 5 

GAG GCC TAT CAC TGC GAG GGC GTG TGC GAC TTT CCG CTG CGC TCG CAC 99 
Glu Ala Tyr His Cys Glu'Gly Val Cys Asp Phe Pro Leu Arg Ser His 
10 15 20 

CTT GAG CCC ACT AAC CAT GCC ATC ATT CAG ACG CTG ATG AAC TCC ATG 14 7 

Leu Glu Pro Thr Asn His Ala He He Gin Thr Leu Met Asn Ser Met 
25 30 c- 35 40 

GAC CCG GGC TCC ACC CCG CCT AGC TGC TGC GTT CCC ACC AAA CTG ACT 195 
Asp Pro Gly Ser Thr Pro Pro Ser Cys Cys Val Pro Thr Lys Leu Thr 
45 50 55 

Q 

•ft CCC ATT AGC ATC CTG TAC ATC GAC GCG GGC AAT AAT GTA GTC TAC AAG 24 3 

'% Pro He Ser He Leu Tyr He Asp Ala Gly Asn Asn Val Val Tyr Lys 
S 60 65 70 

yfjj CAATACGAGG ACATGGTGGT GGGGAATTC 272 

t 

CS (2) INFORMATION FOR SEQ ID NO: 13: 

ru 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 72 amino acids 

fci (B) TYPE: amino acid 

fn (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Trp He He Ala Pro Leu Glu Tyr Glu Ala Tyr His Cys Glu Gly Val 
1 5 10 15 

Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He 
20 25 30 

He Gin Thr Leu Met Asn Ser Met Asp Pro Gly Ser Thr Pro Pro Ser 
35 40 45 

Cys Cys Val Pro Thr Lys Leu Thr Pro He Ser He Leu Tyr He Asp 
50 55 60 

Ala Gly Asn Asn Val Val Tyr Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: mouse 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: mV9 

(ix) FEATURE: 

(A) NAME /KEY : CDS . . 

(B) LOCATION: 28.. 243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

GGATCCAAGG AGCTCGGCTG GGACGAC TGG ATC ATC GCA CCT CTT GAG TAT 51 

Trp lie lie Ala Pro Leu Glu Tyr 
1 5 

GAG GCC TTC CAC TGC GAA GSA CTG TGT GAG TTC CCC TTG CGC TCC CAC 99 
Glu Ala Phe His Cys Glu Gly Leu Cys Glu Phe Pro Leu Arg Ser His 
10 15 20 

TTG GAG CCC ACA AAC CAC GCA GTC ATT CAG ACC CTA ATG AAC TCT ATG 14 7 

O Leu Glu Pro Thr Asn His Ala Val Ile Gln Thr Leu Met Asn Ser Met 

: fS 25 30 35 40 

^ GAC CCT GAA TCC ACA CCA CCC ACT TGT TGT GTG CCT ACA CGG CTG AGT 195 
:sfS Asp Pro Glu Ser Thr Pro Pro Thr Cys Cys Val Pro Thr Arg Leu Ser 
Iff 45 50 55 

CCT ATT AGC ATC CTC TTC ATC GAC TCT GCC AAC AAC GTG GTG TAT AAA 24 3 

fil Pro Ile Ser Ile Leu Phe Ile Asp Ser Ala Asn Asn Val Val Tyr Lys 

jFJj 60 65 70 

!L CAATACGAGG ACATGGTGGT GGGGAATTC 272 

u 

g""s (2) INFORMATION FOR SEQ ID NO: 15: 

H (i) SEQUENCE CHARACTERISTICS: 

iQ (A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Trp lie Ile Ala Pro Leu Glu Tyr Glu Ala Phe His Cys Glu Gly Leu ' 
1 5 10 15 

Cys Glu Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala Val 
20 25 30 

Ile Gin Thr Leu Met Asn Ser Met Asp Pro Glu Ser Thr Pro Pro Thr 
35 40 45 

Cys Cys Val Pro Thr Arg Leu Ser Pro Ile Ser Ile Leu Phe Ile Asp 
50 55 60 

Ser Ala Asn Asn Val Val Tyr Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: BMP/TGF-beta consensus sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Trp Xaa Asp Trp lie Xaa Ala 
1 5 ' 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 .tease pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide #1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
!fll CGGATCCTGG VANGAYTGGA THRTNGC 
1=, (2) INFORMATION FOR SEQ ID. NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: BMP/TGF-beta consensus sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

His Ala He Xaa Gin Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



•vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide 82 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTCTAGAAR NGTYTGNACD ATNGCRTG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide #3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

CCACTGCGAG GGCCTTTGCG ACTTCCCTTT GCGTTCGCAC 

afl (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
4= (A) LENGTH: 2 9 base pairs 

yi (B) TYPE: nucleic acid 

«V (C) STRANDEDNESS: single 

if (D) TOPOLOGY: linear 

m 

Jtj (ii) MOLECULE TYPE: DNA (genomic) 



P (vii)' IMMEDIATE SOURCE : 

CO (A) LIBRARY: oligonucleotide #4 

y 

!N* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

IL^ TGCGGATCCA GCCGCTGCAG CCGCAAGCC 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide US 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GACTCTAGAC TACCTGCAGC CGCAGGCCT 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: oligonucleotide #6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GCGGATCCAA GGAGCTCGGC TGGGACGA 28 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: oligonucleotide #7 



in 

m 

; U (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

93 GGAATTCCCC ACCACCATGT CCTCGTAT 28 

ru 

L. (2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



!p (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: ' 
(B) CLONE: Human VL-1 protein 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 2 . . 964 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 605.. 964 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

G AAT TCG GAT CTC TCG CAC ACT CCT CTC CGG AGA CAG AAG TAT TTG 4 6 

Asn Ser Asp Leu Ser His Thr Pro Leu Arg Arg Gin Lys Tyr Leu 
-201-200 -195 -190 

TTT GAT GTG TCC ATG CTC TCA GAC AAA GAA GAG CTG GTG GGC GCG GAG 94 
Phe Asp Val Ser Met Leu Ser Asp Lys Glu Glu Leu Val Gly Ala Glu 
-185 -180 -175 
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CTG 


CGG 


CTC 


TTT 


CGC 


CAG 


GCG 


CCC 


TCA 


GCG 


CCC 


TGG 


GGG 


CCA 


CCA 


GCC 


142 




Leu 


Arg 

) 


Leu 


Phe 


Arg 


Gin 


Ala 


Pro 


Ser 


Ala 


Pro 


Trp 


Gly 


Pro 


Pro 


Ala 






-17C 








-165 








-160 








-155 






GGG 


CCG 


CTC 


CAC 


GTG 


CAG 


CTC 


TTC 


CCT 


TGC 


CTT 


TCG 


CCC 


CTA 


CTG 


CTG 


190 




Gly 


Pro 


Leu 


His 


Val 


Gin 


Leu 


Phe 


Pro 


Cys 


Leu 


Ser 


Pro 


Leu 


Leu 


Leu 












-150 








-145 








-140 






GAC 


GCG 


CGG 


ACC 


CTG 


GAC 


CCG 


CAG GGG - 


GCG 


CCG 


CCG" 


GCC 


GGC 


TGG 


GAA 


238 




Asp 


Ala 


Arg 


Thr 


Leu 


Asp 


Pro 


Gin 


Gly 


Ala 


Pro 


Pro 


Ala 


Gly 


Trp 


Glu 












-135 








-130 








-125 








GTC 


TTC 


GAC 


GTG 


TGG 


CAG' 


GGC 


CTG 


CGC 


CAC 


CAG 


CCC 


TGG 


AAG 


CAG 


CTG 


286 




Val 


Phe 


Asp 


Val 


Trp 


Gin 


Gly 


Leu 


Arg 


His 


Gin 


Pro 


Trp 


Lys 


Gin 


Leu 










-120 








-115 








-110 










TGC 


TTG 


GAG 


CTG 


CGG 


GCC 


GCA 


TGG 


GGC 


GAG 


CTG 


GAC 


GCC 


GGG 


GAG 


GCC 


334 




Cys 


Leu 


Glu 


Leu 


Arg 


Ala 


Ala 


Trp 


Gly 


Glu 


Leu 


Asp 


Ala 


Gly 


Glu 


Ala 








-105 








-100 








-95 














GAG 


GCG 


CGC 


GCG 


CGG 


GGA 


CCC 


CAG 


CAA 


CCG 


CCG 


CCC 


CCG 


GAC 


CTG 


CGG 


382 




Glu 


Ala 


Arq 


Ala 


Arg 


Gly 


Pro 


Gin 


Gin 


Pro 


Pro 


Pro 


Pro 


Asp 


Leu 


Arg 






-90 










-85 










-80 










-75 










































AGT 


CTG 


GGC 


TTC 


GGC 


CGG 


AGG 


GTG 


CGG 


CCT 


CCC 


CAG 


GAG 


CGG 


GCC 


CTG 


430 


"% 


Ser 


Leu 


Gly 


Phe 


Gly 


Arg 


Arg 


Val 


Arg 


Pro 


Pro 


Gin 


Glu 


Arg 


Ala 


Leu 




in 










-70 










-65 










-60 








CTG 


GTG 


GTA 


TTC 


ACQ^AGA 


TCC 


CAG 


CGC 


AAG 


AAC 


CTG 


TTC 


GCA 


GAG 


ATG 


478 




Leu 


Val 


Val 


Phe 


Thr 


Arg 


Ser 


Gin 


Arg 


Lys 


Asn 


Leu 


Phe 


Ala 


Glu 


Met 




cn 








-55 










-50 










-45 








at. 


CGC 


GAG 


CAG 


CTG 


GGC 


TCG 


GCC 


GAG 


GCT 


GCG 


GGC 


CCG 


GGC 


GCG 


GGC 


GCC 


526 




Arg 


Glu 


Gin 


Leu 


Gly 


Ser 


Ala 


Glu 


Ala 


Ala 


Gly 


Pro 


Gly 


Ala 


Gly Ala 




Co 






-40 










-35 










-30 










y 


GAG 


GGG 


TCG 


TGG 


CCG 


CCG 


CCG 


TCG 


GGC 


GCC 


CCG 


GAT 


GCC 


AGG 


CCT 


TGG 


574 




Glu 


Gly 


Ser 


Trp 


Pro 


Pro 


Pro 


Ser 


Gly 


Ala 


Pro 


Asp 


Ala 


Arg 


Pro 


Trp 




b 




-25 










-20 










-15 














CTG 


CCC 


TCG 


CCC 


GGC 


CGC 


CGG 


CGG 


CGG 


CGC 


ACG 


GCC 


TTC 


GCC 


AGT 


CGC 


622 




Leu 


Pro 


Ser 


Pro 


Gly 


Arg 


Arg 


Arg 


Arg 


Arg 


Thr 


Ala 


Phe 


Ala 


Ser 


Arg 






- 10 










-5 










1 








5 








CAT 


GGC 


AAG 


CGG 


CAC 


GGC 


AAG 


AAG 


TCC 


AGG 


CTA 


CGC 


TGC 


AGC 


AAG 


AAG 


e1o 




His 


Gly 


Lys 


Arg 


His 


Gly 


Lys 


Lys 


Ser 


Arg 


Leu 


Arg 


Cys 


Ser. 


Lys 


Lys 












10 










15 










2 0 










CCC 


CTG 


CAC 


GTG 


AAC 


TTC 


AAG 


GAG 


CTG 


GGC 


TGG 


GAC 


GAC 


TGG 


ATT 


ATC 


718 




Pro 


Leu 


His 


Val 


Asn 


Phe 


Lys 


Glu 


Leu 


Gly 


Trp 


Asp 


Asp 


Trp 


He 


He 










25 










30 










35 












GCG 


CCC 


CTG 


GAG 


TAC 


GAG 


GCC 


TAT 


CAC 


TGC 


GAG 


GGT 


GTA 


TGC 


GAC 


TTC 


766 




Ala 


Pro 


Leu 


Glu 


Tyr 


Glu 


Ala 


Tyr 


His 


Cys 


Glu 


Gly 


Val 


Cys 


Asp 


Phe 








40 










45 










50 














CCG 


CTG 


CGC 


TCG 


CAC 


CTG 


GAG 


CCC 


ACC 


AAC 


CAC 


GCC 


ATC 


ATC 


CAG 


ACG 


814 




Pro 


Leu 


Arg 


Ser 


His 


Leu 


Glu 


Pro 


Thr 


Asn 


His 


Ala 


He 


He 


Gin 


Thr 






55 








60 










65 










70 






CTG 


ATG 


AAC 


TCC 


ATG 


GAC 


CCC 


GGC 


TCC 


ACC 


CCG 


CCC 


AGC 


TGC 


TGC 


GTG 


862 




Leu 


Me t 


Asn 


Ser 


Met 


Asp 


Pro 


Gly 


Ser 


Thr 


Pro 


Pro 


Ser 


Cys 


Cys 


Val 














75 










80 










85 








CCC 


ACC 


AAA 


TTG 


ACT 


CCC 


ATC 


AGC 


ATT 


CTA 


TAC 


ATC 


GAC 


GCG 


GGC 


AAT 


910 




Pro 


Thr 


Lys 


Leu 


Thr 


Pro 


He 


Ser 


He 


Leu 


Tyr 


He 


Asp 


Ala 


Gly 


Asn 










90 










95 










100 
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AAT GTG GTC TAC AAG CAG TAC GAG GAC ATG GTG GTG GAG TCG TGC GGC 958 
Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ser Cys Gly 
105 110 115 

TGC AGG TAGCGGTGCC TTTCCCGCCG CCTTGGCCCG GAACCAAGGT GGGCCAAGGT 1014 
Cys Arg 
120 

CCGCCTTGCA GGGGAGGCCT GGCTGCAGAG AGGCGGAGG A GGAAGCTGGC GCTGGGGGAG 1074 

GCTGAGGGTG AGGGAACAGC CTGGATGTGA GAGCCGGTGG GAGAGAAGGG AGCGCACCTT 1134 

CCCAGTAACT TCTACCTGCC AGCCCAGAGG GAAATAT 1171 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

ip (ii) MOLECULE TYPE: protein 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

45 Asn Ser Asp Leu Ser His Thr Pro Leu Arg Arg Gin Lys Tyr Leu Phe 

m -201 -200 -195 -190 



^ Asp Val Ser Met Leii Ser Asp Lys Glu Glu Leu Val Gly Ala Glu Leu 

(Jj -185 -180 -175 -170 

ni 

■ " Arg Leu Phe Arg Gin Ala Pro Ser Ala Pro Trp Gly Pro Pro Ala Gly 

-165 -160 -155 

m Pro Leu His Val Gin Leu Phe Pro Cys Leu Ser Pro Leu Leu Leu Asp 
-150 -145 -140 

Us 

Ala Arg Thr Leu Asp Pro Gin Gly Ala Pro Pro Ala Gly Trp Glu Val 



=frii -135 -130 -125 

^ Phe Asp Val Trp Gin Gly Leu Arg His Gin Pro Trp Lys Gin Leu Cys 
-120 -115 -110 

Leu Glu Leu Arg Ala Ala Trp Gly Glu Leu Asp Ala Gly Glu Ala Glu 
-105 -100 -95 -90 

Ala Arg Ala Arg Gly Pro Gin Gin Pro Pro Pro Pro Asp Leu Arg Ser 
-85 -80 -75 

Leu Gly Phe Gly Arg Arg Val Arg Pro Pro Gin Glu Arg Ala Leu Leu 
-70 ' -65 -60 

Val Val Phe Thr Arg Ser Gin Arg Lys Asn Leu Phe Ala Glu Met Arg 
-55 -50 -45 

Glu Gin Leu Gly Ser Ala Glu Ala Ala Gly Pro Gly Ala Gly Ala Glu 
-40 -35 -30 

Gly Ser Trp Pro Pro Pro Ser Gly Ala Pro Asp Ala Arg Pro Trp Leu 
-25 -20 -15 -10 

Pro Ser Pro Gly Arg Arg Arg Arg Arg Thr Ala Phe Ala Ser Arg His 
-5 15 

Gly Lys Arg His Gly Lys Lys Ser Arg Leu Arg Cys Ser Lys Lys Pro 
10 15 20 
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Leu His Val Asn Phe Lys Glu Leu Gly Trp Asp Asp Trp lie lie Ala 
25 30 35 

Pro Leu Glu Tyr Glu Ala Tyr His Cys Glu Gly Val Cys Asp Phe Pro 
40 45 50 55 

Leu Arg Ser His Leu Glu Pro Thr Asn His Ala lie lie Gin Thr Leu 
60 65 70 



Met Asn Ser Met Asp Pro Gly Ser Thr Pro Pro Ser Cys Cys Val Pro 
75 80 85 

Thr Lys Leu Thr Pro lie- Ser He Leu Tyr lie Asp Ala Gly Asn Asn 
90 95 100 

Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ser Cys Gly Cys 
105 110 115 

Arg 
120 



(2) INFORMATION FOR SEQ ID NO: 27: 



n (i) SEQUENCE CHARACTERISTICS: 

M (A) LENGTH: 1233 base pairs 

'4S (B) TYPE: nucleic acid 

jj5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



mi 



hi 



4s& 



(ii) MOLECULE TYPE: DNA (genomic) 



! 1 ■ (vii) IMMEDIATE SOURCE: 

* (B) CLONE- DNA encoding BMP2 propeptide/BMP- 12 mature 

Q peptide 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1233 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 847.. 1233 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC 4 8 

Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
-282 -280 -275 -270 

CTC CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG 96 
Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
-265 -260 -255 

TTC GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG 14 4 

Phe Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
-250 -245 -240 -235 

GTC CTG AGC- GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA 192 
Val Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 
-230 -225 -220 

CAG- AGA CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA 240 
Gin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
-215 -210 -205 
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GAC CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC 2 88 

Asp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 
-200 -195 -190 

CAC CGG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC 336 
His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
-185 -180 -175 

CAC CAT GAA GAA TCT TTG GAA GAA CTA -CCA GAA ACG AGT GGG AAA ACA 384 
His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
-170 -165 -160 -155 

ACC CGG AGA TTC TTC TTT -AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT 4 32 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe 
-150 -145 -140 

ATC ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT 4 80 

lie Thr Ser Ala Glu Leu qin Val Phe Arg Glu Gin Met Gin Asp Ala 
-135 -130 -125 

TTA GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC 528 

Leu Gly Asn Asn Ser Ser Phe His His Arg lie Asn lie Tyr Glu lie 
p -120 -115 -110 

^ATA AAA CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG 576 

yJlle Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 

:; p -105 -100 -95 

in 

.r.'GAC ACC AGG TTG GTG J^T CAG AAT GCA AGC AGG TGG GAA AGT TTT GAT 624 

^"'Asp Thr Arg Leu Val^Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp 
£y-90 -85 -80 -75 

fU 

,r "GTC ACC CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT 672 

^ Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
!p -70 -65 -60 

GGA TTC GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC 72 0 

! ^Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
W -55 -50 -45 



:{ _ AAG AGA CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC 768 
^Lys Arg His Val Arg lie Ser Arg Ser Leu His Gin Asp Glu His Ser 
40 -35 -30 



TGG TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA 81*6 

Trp Ser Gin lie Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 

-25 -20 -15 

GGG CAT CCT CTC CAC AAA AGA GAA AAA CGT ACG GCG TTG GCC GGG ACG 864 

Gly His Pro Leu His Lys Arg Glu Lys Arg Thr Ala Leu Ala Gly Thr 

-10 -5 1 , 5 

CGG ACA GCG CAG GGC AGC GGC GGG GGC GCG GGC CGG GGC CAC GGG CGC 912 

Arg Thr Ala Gin Gly Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg 

10 15 20 

AGG GGC CGG AGC CGC TGC AGC CGC AAG CCG TTG CAC GTG GAC TTC AAG 56 0 

Arg Gly Arg Ser Arg Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys 

25 30 35 

GAG CTC GGC TGG GAC GAC TGG ATC ATC GCG CCG CTG GAC TAC GAG GCG 1C=08 

Glu Leu Gly Trp Asp Asp Trp lie lie Ala Pro Leu Asp Tyr Glu Ala 

40 45 50 

TAC CAC TGC GAG GGC CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG 1 056 

Tvr His Cys Glu Gly Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu 

55 60 65 70 
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# 



CCC ACC AAC CAT GCC ATC ATT CAG ACG CTG CTC AAC TCC ATG GCA CCA 1104 
Pro Thr Asn His Ala lie lie Gin Thr Leu. Leu Asn Ser Met Ala Pro 
75 80 85 

GAC GCG GCG CCG GCC TCC TGC TGT GTG CCA GCG CGC CTC AGC CCC ATC 1152 
Asp Ala Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser Pro lie 
90 95 100 

AGC ATC CTC TAC ATC GAC GCC GCC AAC -AAC GTT - GTC • TAC AAG CAA TAC 12 00 

Ser lie Leu Tyr lie Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr 
105 110 115 

GAG GAC ATG GTG GTG GAG- -GCC TGC GGC TGC AGG 123 3 

Glu Asp Met Val Val Glu- Ala Cys Gly Cys Arg 
120 125 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 
; p (D) TOPOLOGY: linear 

€l (ii) MOLECULE TYPE: protein 

x; (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

M Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
t** -282 -280 ^ -275 -270 

63 

;f~~ ? Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
^ -265 -260 -255 

pj Phe Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 

-250 -245 -240 -235 

ly Val Leu Ser Glu Phe Glu Leu Arg. Leu Leu Ser Met Phe Gly Leu Lys 

-230 -225 -220 

•M* Gin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
M< -215 -210 -205 

Asp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 
-200 -195 -190 

His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
-185 -180 -175 

His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
-170 -165 -160 -155 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe 
-150 -145 -140 

lie Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala 
-135 -130 -125 

Leu Gly Asn Asn Ser Ser Phe His His Arg lie Asn lie Tyr Glu lie 
-120 -115 -110 

lie Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
-105 -100 -95 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp 
-90 -85 -80 -75 
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Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
-70 -65 -60 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
-55 -50 -45 

Lys Arg His Val Arg lie Ser Arg Ser Leu His Gin Asp Glu His Ser 
-40 -35 -30 

Trp Ser Gin lie Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
-25 -20 -15 

Gly His Pro Leu His Lys Arg Glu Lys Arg Thr Ala Leu Ala Gly Thr 
-10 -5 15 

Arg Thr Ala Gin Gly Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg 
10 15 20 

Arg Gly Arg Ser Arg Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys 
25 30 35 

Glu Leu Gly Trp Asp Asp Trp lie lie Ala Pro Leu Asp Tyr Glu Ala 
C3 40 45 50 

Tyr His Cys Glu Gly Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu 
! S 55 60 65 70 

Sri Pro Thr Asn His Ala lie lie Gin Thr Leu Leu Asn Ser Met Ala Pro 
J\ 75 _ 80 85 

ipj Asp Ala Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser Pro lie 

jfljj 90 95 100 

Ser lie Leu Tyr lie Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr 
p 105 ' 110 115 

T* a Glu Asp Met Val Val Glu Ala Cys Gly Cys Arg 
120 125 

ip (2) INFORMATION FOR SEQ ID NO: 29: 

f* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: murine MV1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . . 721 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

A AAG TTC TGC CTG GTG CTG GNG NCG GTG ACG GCC TCG GAG AGC AGN 4 6 

Lys Phe Cys Leu Val Leu X01 X02 Val Thr Ala Ser Glu Ser X03 
1 5 10 15 

CNG CTG GCC CTG AGA CGA CTG GGC TTC GGC TGN CCG GGC GGT GGC GAC 94 
X04 Leu Ala Leu Arg Arg Leu Gly Phe Gly X05 Pro Gly Gly Gly Asp 
20 25 30 
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6. 



GGC 
Gly 


GGC 
Gly 


GGC 
Gly 


ACT 
Thr 
35 


GCG 
Ala 


GNC 
X06 


GAG 
Glu 


GAG 
Glu 


CGC 
Arg 
40 


GCG 
Ala 


CTG 
Leu 


TTG 
Leu 


GTG 
Val 


ATC 
He 
45 


TCC 
Ser 


TCC 
Ser 


142 


r 


PPT 

Arg 


Thr 


PR TV 

Gin 
50 


app 

AGo 

Arg 


A A TV 

AAA 

Lys 


PAP 

Glu 


APT 1 

Ser 


PT»P 

Leu 
55 


TTC 
Phe 


CGG 
Arg 


GAG 
Glu 


ATC CGA GCC 
He Arg Ala 
60 


CAG 
Gin 


GCC 
Ala 


190 




Arg 


vjV- 1 

Ala 
65 


prpp 

Leu 


ppp 
Arg 


ppp 
GC(- 

Ala 


PPT 
Old 

Ala 


GCA 
Ala 
70 


GAG 
Glu 


CCG 
Pro 


CCA 
Pro 


CCG 
Pro 


GAT CCA GGA 
Asp Pro Gly 
75 


CCA 
Pro 


GGC 
Gly 


238 




Ala 
80 


cicin 
Gly 


TPR 

Ser 


ppp 

*_Gt_ 

Arg 


AAA 

AAA 

Lys 


ppp 

Ala 
85 


-AAC 
Asn 


CTG 
Leu 


GGC GGT CGC 
Gly Gly Arg 
90 


AGG 
Arg 


CGG 
Arg 


CAG 
Gin 


CGG 
Arg 


ACT 
Thr 
95 


286 . 


r 

i 


GCG 
Ala 


CTG 
Leu 


GCT 
Ala 


GGG 
Gly 


ACT 
Thr 
100 


CGG 
Arg 


GGA GNG 
qly X07 


NAG 
X08 


GGA AGC GGT GGT GGC 
Gly Ser Gly Gly Gly 
105 


GGC GGT 
Gly Gly 
110 


1 1 A 




err* 
Gly 


PPT 

Gly 


ppp 
Gly 


ppp 

Gly 
115 


ppp 
Gly 


ppp 
Gly 


GGC GGC 
Gly Gly 


GGC GGC GGC GGC GGC GGC 
Gly Gly Gly Gly Gly Gly 
120 125 


GGC 
Gly 


GCA 
Ala 


382 


r 

i 

i 


b.! ippp 


7\ pp 

Arg 


ppp 

Gly 
130 


CAC 
His 


GGG 
Gly 


CGC 
Arg 


AGA 
Arg 


GGC 
Gly 
135 


CGG 
Arg 


AGC CGC 
Ser Arg 


TGC GGT CGC 
Cys Gly Arg 
140 


AAG 
Lys 


TCA 
Ser 


430 




H 

m 


par 1 
His 
145 


Val 


pnp 
Asp 


TTT 
Phe" 


A A P 

^Lys 


GAG 
Glu 
150 


CTG 
Leu 


GGC TGG GAC GAC TGG 
Gly Trp Asp Asp Trp 
155 


ATC 
He 


ATC 
He 


GCG 
Ala 


478 




it UiPPA 

3 Pro 
f=|16 0 


r T ,r P7\ 

Leu 


P A P 

GAC 
Asp 


TAC 
Tyr 


GAG 
Glu 


GCA 
Ala 
165 


TAC 
Tyr 


CAC 
His 


TGC 
Cys 


GAG GGC GTT 
Glu Gly Val 
170 


TGC 
Cys 


GAC 
Asp 


TTT 
Phe 


CCT 
Pro 
175 


526 


r 

i 


COpTr* 
UJLeu 


ppp 
Arg 


Ser 


PAP 

His 


CTG 
Leu 
180 


GAG 
Glu 


CCT 
Pro 


ACC 
Thr 


AAC 
Asn 


CAC 
His 
185 


GCC 
Ala 


ATC 
He 


ATT 
He 


CAG 
Gin 


ACG 
Thr 
190 


CTG 
Leu 


574 




QcTC 


AAC 
Asn 


TCC 
Ser 


ATG 
Met 
195 


GCG 
Ala 


CCC 
Pro 


GAC 
Asp 


GCT 
Ala 


GCG 
Ala 
200 


CCA 
Pro 


GCC 
Ala 


TCC 
Ser 


TGC 
Cys 


TGC 
Cys 
205 


GTG 
Val 


CCC 
Pro 


622 




GCA 
Ala 


AGG 
Arg 


CTC 
Leu 
210 


AGT 
Ser 


CCC 
Pro 


ATC 
lie 


AGC 
Ser 


ATT 
lie 
215 


CTC 
Leu 


TAC 
Tyr 


ATC 
He 


GAT 
Asp 


GCC 
Ala 
220 


GCC 
Ala 


AAC 
Asn 


AAC 
Asn 


670 




GTG 
Val 


GTC 
Val 
225 


TAC 
Tyr 


AAG 
Lys 


CAG 
Gin 


TAC 
Tyr 


GAA 
Glu 
230 


GAC 
Asp 


ATG 
Met 


GTG 
Val 


GTG 
Val 


GAG 
Glu 
235 


GCC 
Ala 


TGC 
Cys 


GGC 
Gly 


TGC 
Cys 


718 





AGG TAGCATGCGG TCTGGGGAGG GTCTGGCCGC CCAGGACCCT AGCTCAAGAG 


771 


Arg 














240 














CAGGTGTCAT 


CAGGCCCGAG 


GGACGGCGGA 


CTATGGCCTC 


TGCCAGCACA 


G AGG AG AGC A 


831 


CACAGTTAAC 


ACTCACATTT 


ACACACTCCT 


TCACTCACGC 


ACATGTTTAC 


CGTGGACGGC 


891 


AGGCGCTAAA 


AGCCTTGCTT 


ATTTGCTACC 


ATTGATACAA 


ACCTCTGTCC 


TTTTCGGGAG 


951 


AGGGAAGGGC 


ATCTGTGTTT 


ATGTTGCAGT 


AATTGGCACT 


AAATCCAAGT 


AGAAATGGGT 


1011 


TAGCATTGGA 


TTCTCCTTTT 


AGTTGGAGGC 


GGTGTGGCTG 


GATTCCTGAC 


GTTGGATATG 


1071 


GAG TGC A CTG 


CAGGGCTGGG 


ATACCCAGAT 


TCTCTGGAGT 


GGGCATTGGG 


AACCTTCAAA 


1131 
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AGTAAGGAGC CACTGGGGCT TGGGAGGGAG CACCCGGTTC CTAAACAAGT CTGATGTGTA 1191 
CTGCTCAGTT TG 12 03 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



Lys 
1 


Phe 


Cys 


Leu 


Val 
5 


Leu 


X01 


X02 


Val 


Thr 
10 


Ala 


Ser 


Glu 


Ser 


X03 
15 


X04 


Leu 


Ala 


Leu 


Arg 
20 


Arg 


Leu 


Gly 


Phe 


Gly X05 
25 


Pro 


Gly 


Gly Gly Asp Gly 
30 


Thr 

'T~ 


Gly 

Gin 

50 


Thr 
35 

Arg 


Ala 
Lys 


X06 
Glu 


Glu 
Ser 


Glu 

Leu 
55 


Arg 
40 

Phe 


Ala 
Arg 


Leu 
Glu 


Leu 
He 


Val 

Arg 
60 


He 
45 

Ala 


Ser 
Gin 


Ser 
Ala 


Arg 
Arg 


Ut Ala 


Leu 


Arg 


Ala 


Ala Ala 

^"10 


Glu 


Pro 


Pro 


Pro 


Asp 
75 


Pro 


Gly 


Pro 


Gly Ala 
80 


K Gly 


Ser 


Arg 


Lys 


Ala 
85 


Asn 


Leu 


Gly 


Gly Arg 
90 


Arg 


Arg 


Gin 


Arg 


Thr 
95 


Ala 


3 

fsa Leu 

SI 


Ala 


Gly 


Thr 
100 


Arg 


Gly 


X07 


X08 


Gly 
105 


Ser 


Gly 


Gly 


Gly Gly 
110 


Gly 


Gly 


W Gly 

3 i 
3"""""' 


Gly 


Gly 
115 


Gly 


Gly 


Gly 


Gly 


Gly 
120 


Gly 


Gly 


Gly 


Gly 


Gly 
125 


Gly 


Ala 


Gly 


W Arg 


Gly 
130 


His 


Gly 


Arg 


Arg 


Gly 
135 


Arg 


Ser 


Arg 


Cys 


Gly 
140 


Arg 


Lys 


Ser 


Leu 


His 
145 


Val 


Asp 


Phe 


Lys 


Glu 
150 


Leu 


Gly 


Trp 


Asp 


Asp 
155 


Trp 


He 


He 


Ala 


Pro 
160 


Leu 


Asp 


Tyr 


Glu 


Ala 
165 


Tyr 


His 


Cys 


Glu 


Gly 
170 


Val 


Cys 


Asp 


Phe 


Pro 
175 


Leu 


Arg 


Ser 


His 


Leu 
180 


Glu 


Pro 


Thr 


Asn 


His 
185 


Ala 


He 


He 


Gin 


Thr 
190 


Leu 


Leu 


Asn 


Ser 


Met 
195 


Ala 


Pro 


Asp 


Ala 


Ala 
200 


Pro 


Ala 


Ser 


Cys 


Cys 
205 


Val 


Pro 


Ala 


Arg 


Leu 
210 


Ser 


Pro 


lie 


Ser 


He 
215 


Leu 


Tyr 


He 


Asp 


Ala 
220 


Ala 


Asn 


Asn 


Val 


Val 
225 


Tyr 


Lys 


Gin 


Tyr 


Glu 
230 


Asp 


Met 


Val 


Val 


Glu 
235 


Ala 


Cys 


Gly 


Cys 


Arg 
240 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1046 base pairs 

(B) TYPE : nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(ivj ANTI- SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: MURINE MV2 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 7 90 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

A AGA AAA CAA GCT TGC ATT CCT GCA GGT CCG ACT CTA AGA GGA TCC 4 6 

Arg Lys Gin Ala Cys lie Pro Ala Gly Pro Thr Leu Arg Gly Ser 
^ 15 10 15 

u 

*sp TCA GGG ACC CAA CCC AGG CCG GCT GGG AAG TCT TTC GAC GTG TGG CAG 94 

Ul Ser Gly Thr Gin Pro Arg Pro Ala Gly Lys Ser Phe Asp Val Trp Gin 
% 20 25 30 

*h 

W GGC CTG CGC CCT CAG CCT TGG AAG CAG CTG TGC CTG GAG TTG CGG GCA 14 2 

|4 Gly Leu Arg Pro Glnr^Pro Trp Lys Gin Leu Cys Leu Glu Leu Arg Ala 

fU GCC TGG GGT GAG CTG GAC RCC GGG GAT ACG GGG GCG CGC GCG AGG GGT 190 
* Ala Trp Gly Glu Leu Asp X01 Gly Asp Thr Gly Ala Arg Ala Arg Gly 
50 55 60 

03 CCC CAG CAG CCA CCG CCT CTG GAC CTG CGG AGT CTG GGC TTC GGT CGG 23 8 

|y Pro Gin Gin Pro Pro Pro Leu Asp Leu Arg Ser Leu Gly Phe Gly Arg 

© AGG GTG AGA CCG CCC CAG GAG CGC GCC CTG CTT GTA GTG TTC ACC AGA 286 
|# Arg Val Arg Pro Pro Gin Glu Arg Ala Leu Leu Val Val Phe Thr Arg 
80 85 90 95 

TCG CAG CGC AAG AAC CTG TTC ACT GAG ATG CAT GAG CAG CTG GGC TCT 3 3,4 

Ser Gin Arg Lys Asn Leu Phe Thr Glu Met His Glu Gin Leu Gly Ser 
100 105 110 

GCA GAG GCT GCG GGA GCC GAG GGG TCA TGT CCA GCG CCG TCG GGC TCC 38 2 

Ala Glu Ala Ala Gly Ala Glu Gly Ser Cys Pro Ala Pro Ser Gly Ser 
115 120 125 

CCA GAC ACC GGG TCT TGG CTG CCC TCG CCC GGC CGC CGG CGG CGA CGC 430 
Pro Asp Thr Gly Ser Trp Leu Pro Ser Pro Gly Arg Arg Arg Arg Arg 
130 135 140 

ACC GCC TTC GCC AGC CGT CAC GGC AAG CGA CAT GGC AAG AAG TCC AGG 4 78 

Thr Ala Phe Ala Ser Arg His Gly Lys Arg His Gly Lys Lys Ser Arg 
145 150 155 

CTG CGC TGC AGC AGA AAG CCT CTG CAC GTG AAT TTT AAG GAG TTA GGC 52 6 

Leu Ara Cys Ser Arg Lys Pro Leu His Val Asn Phe Lys Glu Leu Gly 
160 " 165 170 175 

TGG GAC GAC TGG ATT ATC GCG CCC CTA GAG TAC GAG GCC TAT CAC TGC 574 
Trp Asp Asp Trp He He Ala Pro Leu Glu Tyr Glu Ala Tyr His Cys 
180 185 190 
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GAG GGC GTG TGC GAC TTT CCG CTG CGC TCG CAC CTT GAG CCC ACT AAC 622 
Glu Gly Val Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn 
195 200 205 

CAT GCC ATC ATT CAG ACG CTG ATG AAC TCC ATG GAC CCG GGC TCC ACC 670 
His Ala lie lie Gin Thr Leu Met Asn Ser Met Asp Pro Gly Ser Thr 
210 215 220 

CCG CCT AGC TGC TGC GTT CCC ACC AAA CTG .ACT . CCC . ATT. AGC ATC CTG 718 
Pro Pro Ser Cys Cys Val Pro Thr Lys Leu Thr Pro lie Ser lie Leu 
225 230 235 

TAC ATC GAC GCG GGC AAT AAT GTN GTC TAC AAG CAG TAT GAG GAC ATG 766 
Tyr He Asp Ala Gly Asn Asn X02 Val Tyr Lys Gin Tyr Glu Asp Met 
240 245 250 255 

GTG GTG GAG TCC TGC GGC TGT AGG TAGCGGTGCT GTCCCGCCAC CTGGGCCAGG 82 0 

Val Val Glu Ser Cys Gly Cys Arg 
260 

GACCATGGAG GGAGGCCTGA CTGCCGAGAA AGGAGCAGGA GCTGGCCTTG GAAGAGGCCA 8 80 

CAGGTGGGGG ACAGCCTGAA AGTAGGAGCA CAGTAAGAAG CAGCCCAGCC TTCCCAGAAC 94 0 

S3 

J3 CTTCCAATCC CCCAACCCAG AAGCAGCTAA GGGGTTTCAC AACTTTTGGC CTTGCCAGCC 10 00 

TGGAAAGACT AGACAAGAGG GATTCTTCTC TTTTTATTAT GGCTTG 104 6 



(2) INFORMATION FOR^SEQ ID NO:32: 



m 

J: (i) SEQUENCE CHARACTERISTICS: 
SU (A) LENGTH: 263 amino acids 

a (B) TYPE: amino acid 

pr ? (D) TOPOLOGY:, linear 

W- (ii) MOLECULE TYPE: protein 

W 

£T (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

O Arg Lys Gin Ala Cys He Pro Ala Gly Pro Thr Leu Arg Gly Ser Ser 

jjss, 15 10 15 

Gly Thr Gin Pro Arg Pro Ala Gly Lys Ser Phe Asp Val Trp Gin Gly 
20 25 30 

Leu Arg Pro Gin Pro Trp Lys Gin Leu Cys Leu Glu Leu Arg Ala Ala 
35 ' 40 45 

Trp Gly Glu Leu Asp X01 Gly Asp Thr Gly Ala Arg Ala Arg Gly Pro 
50 55 60 

Gin Gin Pro Pro Pro Leu Asp Leu Arg Ser Leu Gly Phe Gly Arg Arg 
65 70 75 80 

Val Arg Pro Pro Gin Glu Arg Ala Leu Leu Val Val Phe Thr Arg Ser 
85 90 95 

Gin Arg Lys Asn Leu Phe Thr Glu Met His Glu Gin Leu Gly Ser Ala 
100 105 110 

Glu Ala Ala Gly Ala Glu Gly Ser Cys Pro Ala Pro Ser Gly Ser Pre 
115 120 125 

Aso Thr Gly Ser Trp Leu Pro Ser Pro Gly Arc Arg Arg Arg Arg Thr 
130 135 140 
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Ala Phe Ala Ser Arg His Gly Lys Arg His Gly Lys Lys Ser Arg Leu 
145 150 155 160 

Arg Cys Ser Arg Lys Pro Leu His Val Asn Phe Lys Glu Leu Gly Trp 
165 170 175 

Asp Asp Trp lie He Ala Pro Leu Glu Tyr Glu Ala Tyr His Cys Glu 
180 185 190 

Gly Val Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His 
195 200 205 

Ala He He Gin Thr Leu ..Met Asn Ser Met Asp Pro Gly Ser Thr Pro 
210 215 220 

Pro Ser Cys Cys Val Pro Thr Lys Leu Thr Pro He Ser He Leu Tyr 
225 230 235 240 

He Asp Ala Gly Asn Asn Xt)2 Val Tyr Lys Gin Tyr Glu Asp Met Val 
245 250 255 



Val Glu Ser Cys Gly Cys Arg 
260 

(2) INFORMATION FOR SEQ ID NO: 33: 



In 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 5 base pairs 

(B) TYPE: nucleic acid 
U, (C) STRANBtfDNESS : single 
ff§ (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



3 

(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE : NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: HUMAN VI- 1 

<ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 138.. 1301 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 990.. 1301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

AACTATAGCA CCTGCAGTCC CTGGTCTTGG GTGTAGGGGT GCGCTCCTGG TCCCGCGGCT 6 0 

C.-.GGGATATG CAGTGACCAA TGGGTTGTTG GCCTGATGGG ACTTTTGGCT TGCTAAACCA 12 0 

AASCTCGGTT CGGATAG CCC GGG CGA AGA CGT CCG CTG CTC TGG GCC AGG 170 

Pro Gly Arg Arg Arg Pro Leu Leu Trp Ala Arg 
-284 -280 -275 

CTC- GCA GCG TTC AGG CTG GGG CAG AGA CGC GGA GTC GGG CGC TGG CTC 218 
Le-- Ala Ala Phe Arg Leu Gly Gin Arg Arg Gly Val Gly Arg Trp Leu 
-270 -265 -260 
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CAA CAG GCC TGG CTC CCA CAT CGA AGA CAG CTG GGC CAT TTG CTG TTA 266 

Gin Gin Ala Trp Leu Pro His Arg Arg Gin Leu Gly His Leu Leu Leu 
-255 -250 -245 

GGA GGC CCC GCG CTG ACA GTG TGC AGG ATT TGC TCT TAC AC A GCT CTT 314 

Gly Gly Pro Ala Leu Thr Val Cys Arg lie Cys Ser Tyr Thr Ala Leu 
-240 -235 -230 

TCT CTC TGT CCC TGC CGG TCC CCC GCA.. GAC. GAA TCG .GCA GCC GAA ACA 362 

Ser Leu Cys Pro Cys Arg Ser Pro Ala Asp Glu Ser Ala Ala Glu Thr 
-225 -220 -215 -210 

GGC CAG AGC TTC CTG TTC GAC GTG TCC AGC CTT AAC GAC GCA GAC GAG 410 

Gly Gin Ser Phe Leu Phe Asp Val Ser Ser Leu Asn Asp Ala Asp Glu 

-205 -200 -195 

GTG GTG GGT GCC GAG CTG CGC GTG CTG CGC CGG GGA TCT CCA GAG TCG 4 58 

Val Val Gly Ala Glu Leu Arg Val Leu Arg Arg Gly Ser Pro Glu Ser 
-190 " v -185 -180 

GGC CCA GGC AGC TGG ACT TCT CCG CCG TTG CTG CTG CTG TCC ACG TGC 506 

Gly Pro Gly Ser Trp Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys 
-175 -170 -165 



CCG GGC GCC GCC CGA GCG CCA CGC CTG CTG TAC TCG CGG GCA GCT GAG 5 54 

: ,fii Pro Gly Ala Ala Arg Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu 
^1 -160 -155 -150 

IH CCC CTA GTC GGT CAG CGC TGG GAG GCG TTC GAC GTG GCG GAC GCC ATG 6 02 

^ Pro Leu Val Gly Gln<Arg Trp Glu Ala Phe Asp Val Ala Asp Ala Met 

P "145 -140 -135 -130 

SV AGG CGC CAC CGT CGT GAA CCG CGC CCC CCC CGC GCG TTC TGC CTC TTG 650 

? Arg Arg His Arg Arg Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu 

-125 -120 -115 

® CTG CGC GCA GTG GCA GGC CCG GTG CCG AGC CCG TTG GCA CTG CGG CGA 6 98 

jjijj Leu Arg Ala Val Ala Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg 
n.' - 110 ~ 105 -100 

Q CTG GGC TTC GGC TGG CCG GGC GGA GGG GGC TCT GCG GCA GAG GAG CGC 74 6 

jU, Leu Gly Phe Gly Trp Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg 
-95 -90 -85 

GCG GTG CTA GTC GTC TCC TCC CGC ACG CAG AGG AAA GAG AGC TTA TTC 7 9A 

Ala Val Leu Val Val Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe 
-80 -75 -70 

CGG GAG ATC CGC GCC CAG GCC CGC GCG CTC GGG GCC GCT CTG GCC TCA 84 2 

Arg Glu lie Arg Ala Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser 

-65 -60 -55 -50 

GAG CCG CTG CCC GAC CCA GGA ACC GGC ACC GCG TCG CCA AGG GCA GTC 8 90 

Glu Pro Leu Pro Asp Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val 
-45 -40 -35 

ATT GGC GGC CGC AGA CGG AGG AGG ACG GCG TTG GCC GGG ACG CGG ACA 938 

lie Gly Gly Arg Arg Arg Arg Arg Thr Ala Leu Ala Gly Thr Arg Thr 
-30 -25 ' -20 

GCG CAG GGC AGC GGC GGG GGC GCG GGC CGG GGC CAC GGG CGC AGG GGC 986 

Ala Gin Gly Ser Gly Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly 
-15 -10 -5 

CGG AG C CGC TGC AGC CGC AAG CCG TTG CAC GTG GAC TTC AAG GAG CTC IC34 

Arc Ser Arg Cys Ser Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu 

15 10 IS 
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• 



GGC TGG GAC GAC TGG ATC ATC GCG CCG CTG GAC TAC GAG GCG TAC CAC 1082 
Gly Trp Asp Asp Trp lie lie Ala Pro Leu Asp Tyr Glu Ala Tyr His 
20 25 30 

TGC GAG GGC CTT TGC GAC TTC CCT TTG CGT TCG CAC CTC GAG CCC ACC 1130 
Cys Glu Gly Leu Cys Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr 
35 40 45 

AAC CAT GCC ATC ATT CAG ACG CTG CTC AAC TCC ATG ; GCA CCA GAC GCG 117 8 

Asn His Ala lie lie Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala 
50 55 60 

GCG CCG GCC TCC TGC TGT GTG CCA GCG CGC CTC AGC CCC ATC AGC ATC 1226 
Ala Pro Ala Ser Cys Cys Val Pro Ala Arg Leu Ser Pro lie Ser lie 
65 70 75 

CTC TAC ATC GAC GCC GCC AAC AAC GTT GTC TAC AAG CAA TAC GAG GAC 12 74 

Leu Tyr lie Asp Ala Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp 
80 85 ^ 90 95 

ATG GTG GTG GAG GCC TGC GGC TGC AGG TAGCGCGCGG GCCGGGGAGG 13 21 

Met Val Val Glu Ala Cys Gly Cys Arg 
100 

o 

yp GGGCAGCCAC GCGGCCGAGG ATCC 134 5 

<£ 

r 

u 



4* (2) INFORMATION FOR SEQ ID NO: 34: 



( i ) SEQUENCER-CHARACTERISTICS : 



(A) LENGTH: 38 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



0=P (B) TYPE: amino acid 



^ (ii) MOLECULE TYPE: protein 

u 

03 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

y 

a T r Pro Gly Arg Arg Arg Pro Leu Leu Trp Ala Arg Leu Ala Ala Phe Arg 
S"* -284 -280 -275 -270 

O 

Leu Gly Gin Arg Arg Gly Val Gly Arg Trp Leu Gin Gin Ala Trp Leu 
! < -265 -260 -255 

Pro His Arg Arg Gin Leu Gly His Leu Leu Leu Gly Gly Pro Ala Leu 
-250 -245 -240 

Thr Val Cys Arg lie Cys Ser Tyr Thr Ala Leu Ser Leu Cys Pro Cys 
-235 -230 -225 

Arg Ser Pro Ala Asp Glu Ser Ala Ala Glu Thr Gly Gin Ser Phe Leu 
-220 -215 -210 -205 

Phe Asp Val Ser Ser Leu Asn Asp Ala Asp Glu Val Val Gly Ala Glu 
-200 -195 -190 

Leu Arg Val Leu Arg Arg Gly Ser Pro Glu Ser Gly Pro Gly Ser Trp 
-185 -180 -175 

Thr Ser Pro Pro Leu Leu Leu Leu Ser Thr Cys Pro Gly Ala Ala Arg 
-170 -165 -160 

Ala Pro Arg Leu Leu Tyr Ser Arg Ala Ala Glu Pro Leu Val Gly Gin 
-155 -150 ..-145 

Arg Tro Glu Ala Phe Asp Val Ala Asp Ala Met Arg Arg His Arg Arg 
-140 " -135 -130 -125 
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Glu Pro Arg Pro Pro Arg Ala Phe Cys Leu Leu Leu Arg Ala Val Ala 
-120 -115 -110 

Gly Pro Val Pro Ser Pro Leu Ala Leu Arg Arg Leu Gly Phe Gly Trp 
-105 -100 -95 

Pro Gly Gly Gly Gly Ser Ala Ala Glu Glu Arg Ala Val Leu Val Val 

-90 -85 -80 



Ser Ser Arg Thr Gin Arg Lys Glu Ser Leu Phe Arg Glu lie Arg Ala 
-75 -70 -65 

Gin Ala Arg Ala Leu Gly Ala Ala Leu Ala Ser Glu Pro Leu Pro Asp 
-60 -55' -50 -45 

Pro Gly Thr Gly Thr Ala Ser Pro Arg Ala Val lie Gly Gly Arg Arg 
-40 -35 -30 

Arg Arg Arg Thr Ala Leu Aria Gly Thr Arg Thr Ala Gin Gly Ser Gly 
-25 -20 -15 

Gly Gly Ala Gly Arg Gly His Gly Arg Arg Gly Arg Ser Arg Cys Ser 
-10 -5 1 

•tOtS, 

Arg Lys Pro Leu His Val Asp Phe Lys Glu Leu Gly Trp Asp Asp Trp 
5 10 15 20 

4= He He Ala Pro Leu Asp Tyr Glu Ala Tyr His Cys Glu Gly Leu Cys 
jj| 25 30 35 

^ Asp Phe Pro Leu Arg Ser His Leu Glu Pro Thr Asn His Ala He He 
Ui 40 45 50 

fy 

^ Gin Thr Leu Leu Asn Ser Met Ala Pro Asp Ala Ala Pro Ala Ser Cys 
L 55 60 65 

ffii Cys v al Pro Ala Arg Leu Ser Pro He Ser He Leu Tyr He Asp Ala 

M 1 Ala Asn Asn Val Val Tyr Lys Gin Tyr Glu Asp Met Val Val Glu Ala 
Q 85 .90 95 100 

^ Cys Gly Cys Arg 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE : 

'(C) INDIVIDUAL ISOLATE: primer number 8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 



7G7ATGCGAC TTCCCGC 



